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METHOD OF UPDATING INTERESTS 

The present invention relates to a method of updating interests, particularly 
updating interests of a user. 
5 In the current information rich climate, a considerable amount of attention is 

being paid to develop improved methods of information retrieval. In particular, the 
field of agent technology is heavily involved in developing agents for searching, 
summarisation, filtering and presentation of information e.g. Davies, Weeks & Revett, 
1997 ("Information Agents for the WWW" in Software agents and soft computing, 

10 Eds H. Nwana, N. Azarmi, Berlin Springer-Verlag). Most, if not all, of these rely on 
the agent having some knowledge of the user. The inclusion of user information is 
rapidly becoming a key area, not only for agent technology, but also for the Internet 
in general, as demonstrated by the recently proposed Open Profiling Standard Dunn, 
Gwertzman, Layman & Partovi 1997 ("Privacy and profiling on the web" Technical 

15 note, WWW consortium 2 June 1997). User information is undoubtedly playing an 
ever-increasing role as digital libraries, electronic commerce, and personalised 
applications become more widespread. 

Filtering agents (such as email prioritising agents) were typically the first 
generation of agents that were concerned with managing volumes of information. 

20 Maes 1994 describes the various motivations, methods and applications of such 
agents for reducing information overload (Maes (1994) "Agents that reduce workload 
and information overload" Communications of the ACM, July). Indeed, this is perhaps 
the single most important reason that agents have succeeded within the Internet 
environment: there are simply too many resources available for any one person to 

25 consult exhaustively, and indeed if one were fortunate enough to have browsed all 
these pages, one would probably find that only a small percentage were actually of 
any interest or relevance. Search agents (such as Amalthaea) are becoming 
increasingly prominent as a means to reduce this workload, with most including some 
method of allowing the user to inform the agent of their interests and preferences. 

30 Further developments within agent technology have begun to consider previous 
interactions with a user as well as learning their interests. 

Many recent agents include some form of user profiling, such as 
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• Yenta (Foner & Crabtree, 1997 ("Multi-agent matchmaking" in Software agents 
and soft computing, Eds H. Nwana, N. Azarmi, Berlin Springer-Verlag )), 

• Syskill & Webert (Pazzani and Billsus, 1997 "Learning and revising user profiles: 
the identification of interesting web sites", Machine learning 27 (3)), 

5 • Personal WebWatcher (Mladenic, 1996 "Personal webwatcher: design and 
implementation", Technical report ljs-DP-7472, School of computer science, 
Carnegie-Mellon University, Pittsburgh USA, October.), 

• Letizia (Lieberman. 1995 "Letizia: An agent that assists web browsing", 
Proceedings, 14 th Joint International Conference on artificial intelligence (IJDAI- 

10 95), Montreal, Canada), 

• NewsSIEVE (Haneke, 1997 "Learning based filtering of text information using 
simple interest profiles", In P. Kandzia & M. Klusch, Co-operative information 
agents. Berlin: Springer-Verlag"), and 

• INSOP (Kindo etal 1997 "Adaptive personal information filtering system that 
15 organizes personal profiles automatically", Proceedings 15 th International Joint 

Conference on Artificial Intelligence (IJCAI-97) pp. 716-721 Nagoya, Japan). 
The agent of Syskill & Webert learns a user's profile and uses this to guide its 
suggestions for interesting web pages. The user rates the relevance of the 
suggestions to the user's interest and the agent employs a naive Bayesian classifier 

20 to revise the user profile accordingly. Amalthaea (Moukas, 1997 (Moukas (1997) 
"User modelling in a multiagent evolving system" Proceedings, workshop on Machine 
learning for user modelling, 6 th International Conference on User modelling, Chia 
Laguna, Sardinia)) employs a weighted keyword representation for a user*s profile, 
which is then consulted in order to query Internet search engines in order to retrieve 

25 pages of interest to a user. Amalthaea's user profile comprises a number of 
information filtering agents (IFAs), each one specialised for a particular topic. These 
IFAs can be constructed in a number of ways (e.g. through analysis of a user's 
hotlist of favourite web pages; through observation of the user's interaction with 
their browser (using page access history logs) etc.) Furthermore the IFAs can evolve 

30 over time, so that they can adapt more closely to the user as the user interacts with 
Amalthaea over time. 

Many web sites collect information about a user in order to a) keep track of 
demographic information, b) provide personalised services and information from the 
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site. This technique is becoming more important for sites to be able to maintain a 
loyal customer base. Indeed Firefly, Autonomy and OpenSesame! have all recently 
announced products which will enable such adaption. 

It is therefore clear that having a profile of a user is crucial in order to provide 
5 services and information of interest to the user. Although the systems described 
above use a variety of techniques and information sources, not many systems 
attempt to arrange the keywords, or identifiers in the profile, as a function of the 
context of a user. The context of a user at a moment in time largely determines the 
type and content of information that is of interest to the user at that moment in time 

10 (e.g. if a user is working then the user is likely to want to know about work related 
interests) and developments in this area could assist in reducing the volume of 
information that is presented to a user. 

The Applicant's patent EP807291 (IPD case ref A24976) teaches a 
software-based system known as the ''JASPER" agent, which stores meta- 

1 5 information, such as URL of the document and keywords indicative of the content of 
the document, relating to documents of interest to a user. JASPER also stores user 
profiles comprising keywords that indicate the interests of users in particular types of 
information, and performs collaborative filtering between users to identify users that 
may have overlapping interests (arranging users into groups). JASPER compares the 

20 meta-information between groups of users, and, if one user in group A has identified 
a document as being relevant to him, JASPER passes the meta-information relating to 
the identified document to other users in group A (e.g. via email). The user profile 
can be arranged to store keywords in categories, as a function of context (e.g. types 
of work, leisure), and JASPER can change the content of these categories in 

25 accordance with detected changes in information type viewed by the user (e.g. 
performing a comparison between keywords stored in respect of one context and 
comparing that to the keywords in documents being viewed). JASPER is thus 
concerned with identifying a set of keywords that characterise a context, for the 
purposes of information delivery and identification between groups of users. 

30 

In the following description, the terms "interest", "user profile", "information 
source", "context", "instantiated interest" are used and are defined as follows: 
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"interest" includes subject-related data such as a set of keywords and/or images 
and/or music that are representative of a subject. Commonly images and music are 
accompanied by some form of description e.g. for music "Faure Requiem, written in 
1887, the piece reflects Faure's vision of death". In this case, keywords could be 
5 extracted from the description, and used to describe an interest (e.g. for the subject 
Impressionist, Death). 

"user profile" includes one or more interests stored in respect of a user, 
"information source" includes an entity that contains information, e.g. a document, 
"context" includes representations that describe the current status of a user - e.g. 
10 state (work, play), situation (home, away) and company (colleagues). 

"instantiated interests" when the status of the user has been identified, the context 
of the user is instantiated - e.g. the user is identified to be working at home, so the 
context is instantiated to work, home. Interests are then filtered according to the 
instantiated context and these interests are referred to as instantiated interests. 

15 

According to the present invention there is provided a method of changing a 
profile representing subject matter of interest to a user. The profile includes a 
plurality of sets of subject-related data, and the method includes the steps of 

(i) suggesting a change to the profile to the user, which suggestion includes one 
20 or more selection choices representative of the suggested change, 

(ii) receiving one or more selection choices from the user, 

(iii) modifying the user profile in accordance with the or each selection choice, 

(iv) monitoring user action in respect of the modification, and, if the user action 
satisfies predetermined criteria, 

25 (v) permanently changing the profile in accordance with the or each selection 
choice. 

Preferably the monitoring step (iv), which occurs with a frequency that is 
configurable by the user, includes the steps of 
30 presenting the user with at least one further selection choice, 
receiving the or each further selection choice, 

comparing the or each further selection choice with the predetermined criteria. 
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In addition, the or each further selection choice includes affirmation of the one or 
more selection choices representative of the suggested change suggested in step (i). 

Alternatively the monitoring step (iv) can comprise counting a number of 
5 times the user accesses information relating to the selection choice, and comparing 
the said number with a threshold number. As a further alternative, the monitoring 
step (iv) can include observing patterns of user access relating to the selection 
choice, and comparing the observed pattern with one or more predetermined 
patterns. Such patterns may include, for example, time spent accessing information 
1 0 relating to the selection choice, time of day that information relating to the selection 
choice is accessed, and repeatability of the way in which the user accesses 
information relating to the selection choice. 

Preferably the method further comprises performing at least one of 
1 5 comparative analysis and/or cluster analysis between profiles of two or more users, 
and receiving output therefrom. This output is then used to provide at least one of 
the selection choices representative of the suggested change. Thus, for example, if 
interest A is linked with interest B and a user U1 has interest A, then interest B 
would be a suggested change. Other data mining and analysis methods could be used 
20 to identify potential interests, such as fuzzy logic, heuristic and knowledge based 
methods. 

Advantageously the method further comprises evaluating confidence values 
associated with the output from the comparative analysis and/or cluster analysis, and 
25 the step (i) of suggesting a change to the contents of the profile based on the said 
output is performed in dependence on the evaluated confidence values associated 
therewith. Such confidence values may, for example, be calculated using statistical 
techniques, in a manner known to one skilled in the art. 

30 . Conveniently the method further comprises comparing the output with a list 

of changes that are deemed not to be relevant to the user, and, if any of the 
selection choices in the output matches any on the list, discarding the matching 
selection choices from the output. Thus interest B would be compared with a list of 
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"non-interests", and only suggested to the user if it has not been entered on the 
"non-interests" list. 



Further aspects, features and advantages of the present invention will be 
5 apparent from the following description of preferred embodiments of the invention, 
which refers to the accompanying drawings, in which 

Figure 1 is a schematic diagram illustrating the physical, or transport, layer of a 
communications system according to the invention; 

Figure 2 is a schematic block diagram showing components of an information 
1 0 provider according to the present invention; 

Figure 3 is a schematic block diagram showing means for generating actions by 
a profiler forming part of the apparatus of the information provider shown in Figure 2; 

Figure 4 is a schematic block diagram showing external inputs to a context 
determining means forming part of the apparatus of the information provider shown in 
1 5 Figure 2; 

Figure 5a is a schematic flow diagram showing steps involved in determining a 
user's context according to the present invention; 

Figure 5b is a schematic block diagram showing further external inputs to the 
context determining means shown in Figure 4; 
20 Figure 6 is a schematic flow diagram showing steps carried out by second 

retrieving means forming part of the apparatus of the information provider shown in 
Figure 2; 

Figure 7 is a schematic block diagram showing the second retrieving means of 
the information provider of Figure 2 co-operating with a monitoring means component; 
25 and 

Figure 8 is a schematic diagram showing a Bayes net forming part of the 
monitoring means component. 

Overview 

30 Referring to Figure 1 , at the physical level the communications environment of 

a user U1 includes a cellular telephone T1 and a personal digital assistant T2 carried by 
the user; and a computer workstation T3 including a modem. The workstation T3 is 
most likely to be located some distance away from the user U1 . 
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The workstation T3 in this case comprises a local area network (LAN) server, 
connected to further terminals (only one shown T4) at different distances from the user. 
In communication with the various terminals are a number of different communications 
channels forming parts of different notional networks (although some or all may be 
5 commonly owned). The workstation T3 is also connected to database servers DB1, 
DB2, shown on the same LAN N3 as the terminal T3; it is understood that the database 
servers DB1, DB2 could be located on LANs that are connected to LAN N3 via one or 
more switches and/or routers (not shown). 

A public switched telephone network (PSTN) N1 is interconnected with an 

10 integrated services digital network (ISDN) N2 via a gateway G1 (e.g. a local or 
international switching centre), and is connected via an ISDN line L1 to terminal T3, and 
hence to local area network N3. A public land mobile network (PLMN) (e.g. a GSM - 
compatible digital cellular network) N4 is connected via a gateway G2 to the PSTN N1 
and ISDN N2. A base station B1 of the PLMN provides a Pico cell in the environment of 

15 the building within which the user U1 is located, and a base station B2 provides a cell 
within the same general area. Thus, the networks N1-N4 are capable of delivering data 
at different rates to the various terminal T1-T3: low speed data via the PLMN N4, 
higher speed data via the PSTN N1 , and yet higher speed data via the ISDN N2 or LAN 
N3. 

20 Information sources, held for example on servers 1S1-IS4, are distributed 

throughout the networks N1-N4. Only four servers are shown, however other 
distributions are envisaged. The user U1 wishes to receive pertinent information from 
any or all the information sources in the system. 

25 Overview of an system for providing information that utilises an embodiment of the 
invention 

The context of a user U1 at a moment in time largely determines the type 
and content of information that is of interest to the user U1 at that moment in time, 
e.g. if a user U1 is at work then the user U1 is likely to want to know about work 
30 related interests. If information is categorised and presented according to the user's 
context, it achieves three things: firstly the volume of information presented to the 
user U1 is reduced, secondly the user does not have to manually filter out 
information that is not relevant to his present context, and thirdly the user is not 
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distracted by information that is not relevant to his present context. It is therefore 
convenient to filter information as a function of a user's context in order to reduce 
the volume of information that is not relevant to the user. 

Referring to Figure 2, an embodiment of the invention, which is generally 
5 referred to as information provider 200, is located on terminal T3. The information 
provider 200 essentially comprises a profiler 201 for generating a profile comprising 
interests of a user, interfacing means 203 for interfacing with the user, Index 
generator 205 for indexing information sources stored on servers IS1 - IS4, and 
retrieval engine 207 for retrieving information in accordance with the context of a 
10 user. 

The components of the information provider 200 are described in greater 
detail below, but in overview, the profiler 201 receives interests information from the 
user, via interfacing means 203, and generates a personal profile, which is stored in 
profile database DB1, for the user. The interests comprising the profile are 
15 accompanied by attributes, e.g. home/away, work/play, which generally describe a 
context of the user. The Index generator 205, as mentioned above, indexes 
information sources stored on servers IS1-IS4. 

The retrieval engine 207 comprises context-determining means 209, first 
retrieving means 211 and second retrieving means 213. Context-determining means 
20 209 receives information from external sources relating to the user U1 - e.g. the 
location of the user, the time of day, and one or more inputs from a diary, from 
which the context of the user U1 is determined. The first retrieving means 211 is 
arranged to retrieve information from the personal profile generated by the profiler 
201, such that, once the context determining means 209 has determined the context 
25 of the user, the first retrieving means 211 can retrieve interests relating to that 
context. The second retrieving means 213 uses the context-related interests retrieved 
by the first retrieving means 211 to query the index generator 205 and retrieve 
information sources corresponding to those queries. 

Many people have concerns over the accessibility and manipulation of 
30 aspects of their user profile by parties with whom they do not have a trusted 
relationship. Moreover, it appears that many users do not want automatic profile 
adaptation, preferring instead to have full control of any profile changes. However, 
systems that are user intensive, such as systems where the user is required to make 
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changes to his profile, run contrary to the ideals of agent-based systems, where 
agents are developed to reduce the load on a user. 

Essentially the present invention, which concerns aspects of the profiler 201, 
is used to modify the content of a user profile in accordance with various external 
5 inputs and feedback from the user. Changes are suggested by the system, but are 
only effected with the permission of the user. This therefore saves the user from 
directly modifying the profile, but the changes are made solely in response to the 
permission of the user. Furthermore, changes are reviewed, so that if a change is 
found to be unacceptable, it can be modified and/or reversed. The reviewing of 

1 0 changes can occur over a timescale that is selectable by the user, so that the user 
can inform the system that she has not had time to test the changes, thereby re- 
setting the review time. 

Such an arrangement, for personalised information retrieval, strikes a balance 
between the two conflicting issues of load on the user and over-automated profile 

1 5 management. 

Profiler 201 

The information provider 200 has a predetermined, finite number of interests 
from which interests associated with a user U1 can be selected. These interests fall 
20 within the following non-exhaustive list of categories: 

• General interests: e.g. current affairs, finance, home, business, leisure, hobbies, 
clubs etc.; 

• Places: e.g. home town, holiday destination, business trip, family homes etc. 

There is a "master copy" of general interests, which are interconnected via 
25 connectors as an acyclic graph. The master copy is stored in a database table in the 
profile database DB1 . Each of the interests (of type general or location) has one or 
more predefined keywords that are associated with the interest, and these keywords 
are also stored within the profile database DB1, linked to their respective interest. 
(The keywords are used for retrieval of information relevant to the user, as described 
30 in more detail later). Each interest has a unique identifier (ID), and the connectors in 
the acyclic graph link interests together (so that the inclusion of keywords for an 
interest is at least partly dependent on which other interests are linked to that 
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interest such that, for example, interest .1 inherits a predetermined number of 
keywords from its parent and child interests). 

The profiler 201 is arranged to receive interest information from the user and 
to generate a profiie that comprises the user's interests and keywords relat.ng to 
5 those interests. The profiier 201 can also implicitly identify interests in accordance 

with predefined rules. 
Explicit modification of profile: 

A user can explicitly input words and/or phrases that are representafve of 
,h. user's interests, whereupon the pro.i.er 201 attempts to find interests within the 
,0 master copy the, match the input. This Involyes searching through the acyclic graph 
,or a match between the input and the interests, possibly using WORDNET, a lex,ca, 

an interest that matches the input, the user U1 can nayigate through th. mterests 

and select interests manually. . 
15 The profiler 201 can additionally co-operate with a parser (not shown) ,n 

order to transiate the entered keywords and/or phrases into one or more 
predetermined interests; many known systems could be adapted to parse input and 
translate the parsed input into a format understood by the profUer 201. 

,n addition, users can access their profile at all times. Thus a user .s able to 
20 view their profi.e, view their instantiated context, make changes direct.y to the.r 
profile, and indeed to the information sources used to retrieve information in respect 
of the interests in the profile. 

Implicit modification of profiie: 
25 The profiler 201 can include one or more rules that inter-relate interests, 

such that i. interest A is linked with interest B and a user U1 has interest A. then the 
profiler 201 will automatically include Interest B in the user's profile. Rules can also 
M based on information retrieyed by the profiler 201 from other users and/or externa, 
sources. The latter can be particularly uaefu. for proyiding servica-related Information 
30 to users - e.g. H one of the information sources »1 - .34 Includes informal 
relating to banks, the information proyider 200 haa the potential to deliver bank- 
related information to a user. However, in order to deliver bank-related informauon 
that is relevant to the user, the Information provider 200 needs to know wh,ch bank 
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the user U1 has an account with. Clearly the user U1 could explicitly enter this 
information, but it would be far more convenient to the user U1 if this information 
could be retrieved without having to involve the user U1 directly. To this end the 
profiler 201 can be further arranged to co-operate with web browsers (not shown), 
5 one located on each terminal T1, T2, T4 used by the user U1. In particular the 
profiler 201 could download software, for website tracking, on the hard disk drive of 
the terminal(s) T1, T2, T4. Assuming the user U1 banks "on-line", the URL of the 
user's bank could be accessible to the downloaded software, and from this the 
profiler 201 can retrieve the name of the bank, and add this to the user profile in the 
10 profile database DB1. Alternatively, if the profiler 201 has access to billing 
information in respect of third party services and/or systems used by the user, the 
profiler 201 can be arranged to request bank details from the billing system relating 
to the third party. The third party system could be located on one of the networks N1 
- N4. 

15 The user U1 can override any interests that have been inferred implicitly. 

The profiler 201 is arranged to tag a context type to each interest. In this 
embodiment, two categories of context are described, namely state and situation, 
although many other context categories are possible. In their simplest forms, these 
context categories have two types: state context has work and play, and situation 

20 context has home and away. For each category of context, some interests may be 
tagged with both types of contexts - e.g. user U1 may be interested in computers 
for Work and Play purposes. The context type can be entered explicitly or implicitly, 
and each interest can be tagged with a default context type. 
Storage of profile 

25 When an interest is selected for a user U1, the profiler 201 enters an 

identifier corresponding to the ID of that interest, together with the position of that 
interest in the tree hierarchy, into a database table for the user U1. The processes 
involved in storing data in, performing queries on and receiving data from, databases 
are known to those with ordinary skill in the art of database administration. For place 

30 names interests, e.g. place names in the UK, there is a "master copy" of place 
names, each of which is accompanied by county information, and an interest in 
respect of a place name is dynamically created using country/county/place retrieved 
from the master copy. 
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Interests can additionally be stored with attributes such as stock value and 
location. Setting the stock value attribute indicates that the user U1 is interested in 
stock values in respect of a particular interest (e.g. interest may be a company). 
Setting the location attribute indicates that the user U1 is interested in location 
5 information relating to an interest (e.g. interest may be banks, so that specifying a 
location attribute, such as Bognor Regis, refines the interest to banks in Bognor 
Regis). Other attributes include expertise, privacy, relevance weightings, priority 
weightings, times, days, volume of information, delivery channels, layout, quality of 
information sources, and bookmarks. There are default settings for each of the 
10 attributes, but the user U1 can change any of these settings via interfacing means 
203 (described below). 

The profiler 201 can also "push" questions out to the user U1 - e.g. to 
suggest interests to the user in accordance with predetermined rules. The 
15 predetermined rules include implicit profiling (e.g. via techniques such as cluster 
analysis, clickstream, collaborative filtering etc.), changes in the interest acyclic 
graph, information from the system administrator, and feedback from the user U1. 
The profiler 201 monitors the user's response to the questions, and modifies the 
rules in accordance with the response. 
20 Thus, referring to Figure 3, the profiler 201 preferably additionally comprises 

profile modifier 301 for receiving information from a variety of inputs 31 1, 313, 31 5/ 
317, 319 (e.g. output of collaborative filtering, changes to system interests etc., or 
feedback from the user U1) and creating actions 305 based on these inputs. The 
inputs may be accompanied by conditions such as time to push information to the 
25 user U1, priority status relating to type of interests etc. The profile modifier 301 has 
access to a predetermined list of functions F1...Fn, and, based on the nature of the 
input and corresponding conditions, selects one of the functions to create an action 
305. The functions stored in the list include "user interaction functions", which 
include the functionality to activate suggestions and questions to prompt the user U1 
30 with (qualified by question type), and "system functions" in respect of interests, 
which include the functionality to add new interests, delete referred interests, modify 
referred interests etc. The "user interaction functions" include pushing information to 
the user via interfacing means 203, and can be tagged with question type, including 
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"on trial", and "revert to original settings" etc., and delivered to the user in 
accordance with the priority of the request. The "system functions" include effecting 
a change to a stored user's profile. This further functionality of the profiler 201 is 
illustrated in the following example: 
5 An interest 11 is passed to the profile modifier 301 as a result of 

collaborative filtering of interests between users. The profile modifier 301 creates a 
suggestion action 305, the suggestion being that new interest 11 should be added to 
the user's profile, and, because the interest is a new interest, profile modifier 301 
assigns a condition of low priority to the suggestion. The suggestion action 305 is 
10 then pushed to the user U1. Typically a new interest will also be tagged with "on 
trial", such that the message pushed to the user U1 reads: 

"We think that you may be interested in adding a new interest 11 to your prof He. 

This wiil have the effect of returning stories like Story 1, Story 2, Story 3 

Wouid you like to add 11 to your profile? Please answer Yes, No, 111 try it for a week '' 
15 Profile modifier 301 is also operable to receive responses from the user U1. If 

the user selects the option "I'll try it for a week", a message is sent to the profile 
modifier 301, which identifies the type of feedback, stores the message and creates 
a timer condition in respect of an action to prompt for some user feedback. Once the 
timer has expired, profile modifier 301 generates a feedback action 305 for feedback, 
20 kills the timer and deletes the message from the profile modifier 301. The message 
pushed to the user reads: 

"Last week RF suggested the addition of interest J 1 to your prof He. 

Would you like to keep this change? 

Please answer Yes or No 
25 (If you answer No you will revert back to your original settings)" 

The feedback from the user U1 is received by profile modifier 301, which identifies 
the type of feedback. If the feedback includes making a change to the user profile, a 
system action 305 is generated based on the system functions and the profile is 
modified as described above. 
30 Alternatively, the profile modifier 301 could monitor the number of times the 

user accesses information relating to the suggested interest 11, and, if the number 
exceeds a threshold value, the profile modifier 301 could generate a system action 
305 of automatically adding the interest 11 to the profile. Additionally, the profile 
modifier 301 could identify the time that, and location at which, the user accesses 
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information relating to the interest 11, and could store this as context information for 
the interest 11. Then, when the interest 11 is added to the profile, the profile modifier 
301 additionally tags the interest with the stored context information. 

In addition, or as an alternative, to monitoring a number of times the user 
5 accesses information relating to the suggested interest 11, the profile modifier 301 
could be arranged to monitor the manner in which the user accesses information 
relating to the suggested interest 11 . The profile modifier 301 could then be arranged 
to add the interest 11 to the profile only if aspects of their access behaviour satisfy 
certain conditions. For example, a condition could be that a user spends a minimum 
10 of 2 minutes accessing information, or that the user accesses information relating to 
the interest at least five times each day for 3 days. Such access patterns could take 
many forms, as would be appreciated by one skilled in the art. 

Other inputs 319 include system administrator inputs - e.g. terminal format in 
which the user prefers to receive information. 
15 The profile modifier 301 further includes means for storing feedback from the 

user U1, in particular maintaining lists 310 of interests in which the user U1 is not 
interested (e.g. as a list of deleted interests). Thus whenever an input is received in 
respect of a new interest, profile modifier 301 compares the new interest with the 
list of deleted interests 310 (and does not create an action in respect of deleted 
20 interests). In addition to strict comparison between incoming inputs and deleted 
interests, the profile modifier 301 can access the lexicon WORDNET, or similar, in 
respect of the interests on the deleted list, comparing new requests with terms 
identified by WORDNET as being equivalent to deleted interests. 

In another embodiment, the profile modifier 301 only creates a user 
25 interaction function if it is confident that the interest will be relevant to the user. 
Thus, for each potential new interest, the profile modifier 301 evaluates a confidence 
value, and compares that with a threshold confidence value. If the confidence value 
is greater than the threshold confidence value, the profile modifier 301 generates a 
user interaction function (e.g. the suggestion action), as described above. This is 
30 shown in Figure 3, where a confidence value 321 is associated with the various 
inputs 311, 313, 315, 317, 319. 

If a new interest has been identified through statistical techniques such as 
clustering or filtering, confidence values can be evaluated from the data used in the 
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clustering. For example, if an interest has been identified using cluster analysis, the 
standard deviation of points within the cluster can be evaluated, and used to derive a 
confidence value associated with the cluster analysis. 

5 Interfacing means 203 

Interfacing means 203 controls the communication of information between 
the user U1 and the information provider 200. As shown in Figure 1, the user U1 
can, for example, communicate with the information provider 200 located on terminal 
T3 via any one of three Terminals T1 , 12, and T4. Two of the Terminals T1, T2 can 

10 be arranged to send and receive data using WML (Terminals T1 and T2 having a 
Bluetooth emitting and receiving device respectively so that information that is 
received via terminal T1 can be received on terminal T2), and the third terminal T4 
can be arranged to send and receive data using HTML. Each of the terminals T1, T2, 
T4 is capable of running web browsers. The user enters data via a form on the 

1 5 browser, in a known manner, in order to perform a number of actions, including (non- 
exhaustive list): 

• modifying interests comprising the user profile; 

• modifying the user's context; 

• viewing items pushed to the user; and 
20 • selecting items pushed to the user. 

This form essentially captures any parameters entered by a user and transfers the 
captured parameters to other parts of the information provider 200 stored on the 
Terminal T3. For further information see "Client/Server Programming with Java and 
Corba", 2 nd Edition, R. Orfali and D, Harkey, pp. 239 - 242. 

25 When the user U1 explicitly adds or deletes an interest from the user profile, 

the form posts the interests, together with the action that is to be performed in 
respect of that interest (e.g. add, delete), to the profiler 201, which processes this 
information as described above. The form can include drop down menus and the like 
to enable a user U1 to select from a predetermined list, which is especially useful for 

30 defining interests in films, food, etc. as the user U1 can select which film he is 
interested in, or type of food he is interested in. 
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Interfacing means 203 can alternatively or additionally be configured with 
means (not shown) to present at least part of the shortlist and/or interactions with 
the user U1 via a speech medium. 

5 Index generator 205 

Index generator 205 creates an index for each information source stored on 
information source servers IS1 - IS4. and stores the information in index tables on 
index database DB2. Each index table includes at least some of the following 
information: 

10 1 . The name of information source (e.g. name of a document). 

2. Key terms that have been extracted from the information source. 

3. The network address (URL) of the information source. 

4. A locality record for the information source 

5. Date and time that the information source was created. 

1 5 6. Context attribute of information source - i.e. whether relevant to Work, Play, 
Home or Away (optional). 

A locality record is a location defined in latitude, longitude and altitude, and a 
three-dimensional shape defining the locality in which the record is valid, with the 
defined location at the centre of three-dimensional shape. Localities with which 
20 respective information sources are associated are individually configurable. Some 
information (for example weather information) could therefore be readily accessible by 
the user over a relatively wide area, whereas other information, being more location- 
specific (such as local services information), could be filtered out for all areas except in 
a narrowly defined locality. 
25 The type of information indexed by Index generator 205 includes News data, 

stock market listings, Lycos, weather data, classified advertisements, event guides, 
entertainment information such as cinema listings, places to stay, places to eat, 
sport, financial information, shopping information e.g. store guides, and information 
on services e.g. cash point locations, petrol station locations. There are many 
30 methods for building indexes to unstructured information sources, and the particular 
method employed is unimportant for this embodiment. Thus, any means that is 
capable of retrieving and storing data in accordance with the index table defined 
above can be used (see "Managing Gigabytes" by Ian H. Witten, Alistair Moffat, 
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Timothy C. Bell Morgan Kaufmann; ISBN: 1558605703 for a comprehensive review 
of many methods currently employed in search and retrieval technology). The Index 
generator 205 accesses the information servers IS1 - IS4 at regular intervals so that 
the index tables are up to date (daily, hourly, weekly, depending on the nature of the 
5 information source and the rate at which it changes). 

Retrieval engine 207 

The function of retrieval engine 207 is to determine and instantiate the 
user's context in real-time, to retrieve interests associated with the instantiated 

10 context from the profiler 201 and to retrieve information from information sources 
IS1 - IS4 via Index generator 205 based on those interests. 

The first stage - in which the retrieval engine 207 determines and 
instantiates the user's context - is performed by context-determining means 209 and 
is described with reference to Figure 4. Context-determining means 209 includes a 

15 configurable list 400 defining each of the context types - e.g. HOME can be defined 
by a location - as latitude and longitude values, and AWAY can be defined as any 
location more than n1 Km from home. Similarly, WORK can be defined by time of day 
and location, and PLAY can be defined by time of day and any location more than n2 
Km from work. The Context-determining means 209 is arranged to receive input from 

20 several sources, e.g. a diary 401, a clock 403, and a personal tracking device 405, 
and to combine these inputs with information in the predetermined list so as to derive 
the context of the user U1 . The examples shown in Figure 4 are merely illustrative of 
the types of sources that can co-operate with Context-determining means 209 {more 
examples are given below). 

25 The clock 403 could be a system clock, located on the Terminal T3. The 

diary 401 could be Microsoft Outlook™ calendar, or a custom built diary, to which the 
Context-determining means 209 can submit queries at predetermined times, 
requesting time-tagged data in respect of the user's planned actions. Information 
relating to interfacing with Outlook™ can be found in "Programming Microsoft 

30 Outlook and Microsoft Exchange" by Thomas Rizzo, Microsoft Press; ISBN: 
0735605092. Entries in a diary 401 often include phrases, which require to be 
parsed into a format recognised by the Context-determining means 209; the diary 
401 can be configured to interact with a parser and translate the entries accordingly. 
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The persona, tracing device 405 la carried by the uaer U1 and enables his 
posidon within .he communions environment to be .racked. For example In , thrs 
embodiment the .racking device 406 comprises a chip carrying card or smart card 
oanying da.a identifying ,he use, and some or a„ of ft. .erminals T, -T6 canymg a card 
5 reader are arranged .oread, he card. Alternatively, h could comprise a smart badge 
transponder, .he ,oca.ion of which is .racked widrin .he building. Specie* .he 
workstations T1-T4 and .he celluler .elephone T, carry auch smart earn tea a . 
AddWona. em,,, card reader, are ins.a,led a. access doors wi,hin the bu„d,ng and am 
connected to the LAN N3 .o signal thereon. Further, preferably, the cal, phone 
10 compdsee. in addition ,o cel, phone communing components, a global p«mon,ng 
system .GPS, receiver end is arranged to derive and eigne, Hs poemon P-*"»- 
disolosed in EP 046766, (M o.oro,a,. A,.ema«va,y, .he cel, phone T, could erfonn 
poamoping addidonally using a land-baaed positioning signal, such aa by ^ferenti , GPS 
Positioning, or purely using land-baaed positioning aignals, such as drfterentia, GSM 

1 5 triangulation signals. 

Once the Context-determining means 209 has gathered informat.on from the 
diary 401, the Cock 403 and the tracking device 405, and other such devices (not 
shown), this information is combined in order to instantiate the user's context. As an 
example, this process is defined with reference to Figure 5a: 
20 . S 5.1 Take current time (t) and compare with time slots predef.ned for work 
(start time Ti. end time T*. nominally applies to weekdays): 
i |f (T, < t < T 2 ) and diary information work related: context WORK 
H. H (Ti < t < T 2 ) and diary information play related: context PLAY (e.g. day 
on leave) 

25 iii. If (t < Ti OR t > T.) and diary information work related: context WORK (e.g. 

working in evening) 

iv.lf (t < Ti OR t > Td and diary information play related: context PLAY (,.e. 
evening free) 

. S 5.2 Transform position data (.,, 1.) into latitude and longitude and compare w,th 
30 position for home {loci, I0C2): 

(i) If (loci < < loot + nD AND (loc, < «. < loc 2 + n1): context HOME 
if (|, > loci+n1) OR (I] > loc 2 + n1): context AWAY 
such that 



WO 02/41176 PCT/GB01/05049 



S 5.1 (i or iii) and S 5.2 (i) give WORK, HOME (W, H) as contexts 
S 5.1 (i or iii) and S 5.2 (ii) give WORK, AWAY (W, A) as contexts 
S 5.1 (ii or iv) and S 5.2 (i) give PLAY, HOME.(P, H) as contexts 
S 5.1 (ii or iv) and S 5.2 (ii) give PLAY, AWAY (P, A) as contexts 
5 In addition to instantiating the context of the user with respect to state and 

situation, additional context information can be determined, for example, "outside", 
"stressed", "engaged", etc. Inputs from the devices listed at points 1 - 7 below, 
some of which are shown in Figure 5b, may be used to identify this additional 
context information. Such additional context information can also be used to 
1 0 determine whether or not to push information to the user, as described in more detail 
below. The devices may include: 

1 . Information from the user: the user U1 can use a predetermined key, or sequence 
of keys, to indicate that he is distracted or on the move via terminal T1 ; 

2. Weather information; 

15 3. Light level detectors, such as Photovoltaic sensing devices available from Remote 
Measurement Systems 2633 Eastlake Ave E, Suite 20, Seattle WA 98102 (e.g. 
PVC-1 device) 

4. Thermocouple sensors, such as temperature indicators available from TC Ltd (TC 
Ltd, P.O. Box 130, UXBRIDGE UB8 2YS, England) - to help determine whether 

20 the user U1 is outside or inside; 

5. Input representative of stress levels - e.g. monitoring body temperature or 
interactions of the user with his terminal T1, T2, T4. Monitoring of user/terminal 
interaction is disclosed in the Applicant's published application WO00/25565 (IPD 
case ref A25709); 

25 6. Input representative of Noise levels - can be determined by analysing the audio 
signal received via the audio input on the terminal T1; 
7. Inputs from additional physical sensors, connected to the terminal T1 by a serial 
interface (not shown), including inputs from 

o a low acceleration detector 501 (e.g. ADXL202, which measures 
30 acceleration of a low force in two directions); 

o a high acceleration detector 503 (ACH0408-05 available from 
Measurement Specialities Incorporated (which can be referenced via 
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Universal Resource Locator (URL) http:// www , msillga nnm on the Mm ^ ) 
wh.ch measures acceleration of a high force in three directions- 

o a direction detector 505 (e.g. a compass, HMC2003, avai.ab.e from 
Honeywell (referenced via URL http:// www.ssechonevw e ...^ which 
g.ves an absolute measurement of orientation of the terminal T1 ); and 

o a direction and velocity detector 507 (using an ENC Piezoelectric Vibrating 
Gyroscope (part number S42E-2 which is sold under the registered 
trademark GYROSTAR, available from Murata manufacturing Company Ltd 
(URL http:/ /www.murata.com) ). The acceleration, direction and velocity 
detectors may be connected via a multiplexer 509 (MUX) to the context- 
determining means 213. 

The complex nature of the physical data makes the use of simple rules for 
determmmg the user's context from all of these inputs unreliable. Hidden Markov 
Mode,s (HMM), or a Kalman Filter, can be used to determine more accurately the 
1 5 user's context when the instantiation of context is based on a combination of inputs 
- 7. A good description of the mechanics and applications of Kalman Filters is 
provided by du Plessis, R.M., 1967; "Poor nan's elation of Kalman Filters or 
How I stopped worrying and learned to love matrix inversion". 

20 The second stage - in which the retrieval engine 207 retrieves interests 

associated with this instantiated context from the profi.er 201 - is performed by first 
retrieving means 21 1, which submits a request in respect of the instantiated context 
to the profi.er 201 . The profi.er 201 queries the profile database DB1, which retrieves 
mformation in accordance with the query, and returns interests relating to the 
■nstantiated context to the first retrieving means 21 1 . For examp.e, if the instantiated 
context is Work, Play, the database DB1 searches for interests tagged with W, P. 

The third stage - in which the retrieval engine 207 retrieves information 
relevant to the user's instantiated interests from information sources IS1 - IS4 - is 
30 performed by second retrieving means 2 1 3, as shown in Figure 6. 
Second retrieving means 213: 

• S 6.1 Receives the instantiated interest data from first retrieving means 211; 
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• S 6.2 Selects one of the index tables containing data relating to an information 
source, and checks for correspondence between instantiated interests and indexed 
keywords, e.g. using a key word match; 

• S 6.3 Retrieves network address (URL) and name of information source for positive 
5 key word matches; 

• S 6.4 Repeats S 6.2 and S 6.3 for all of the information sources that have been 
indexed by Index generator 205; 

• S 6.5 Compiles network address (URL) and name of the information sources that 
have been retrieved at S 6.3 into items on a shortlist. 

10 In an alternative embodiment, first retrieving means 211 retrieves all of the 

user's interests. Then, using the context attributes tagged to the interests, e.g. football 
P (Play), H (Play, Home); computers W, P (Work, Play), second retrieving means 213 
ranks the items on the shortlist in accordance with the user's instantiated context. The 
items on the shortlist created at S 6.4 are then organised in accordance with the 

15 rankings, so that items that are most relevant to the user's instantiated context are 
placed at the top of the shortlist. 

When a significant volume of data is indexed, e.g. around 2 Gigabytes or 
more of data, the capabilities of current hardware mean that the invention is likely to 
be operated in accordance with the first embodiment (i.e. retrieve data only in 

20 respect of instantiated interests). In this case, the second retrieving means 213 can 
be arranged to retrieve information relating to interests other than the instantiated 
interests at a later (predetermined) time. 

The shortlist can be stored in the database DB1, and shortlist page views are 
25 created (e.g. HTML/WML page views are created comprising shortlist data) for delivery 
to the user U1 via interfacing means 203. 

Interfacing means 203 can be arranged to receive the shortlist page views 
from the database DB1 at predetermined intervals, and to post that data for display 
on the user's Terminal T1, T2, T4. In one arrangement the shortlist page views are 
30 requested from the second retrieving means 213, and passed from the second 
retrieving means 213 to the interfacing means 203. As stated earlier, items that rank 
the highest are displayed at the top of the list. The user U1 can select any of the 
items on the shortlist, and this selection is posted to second retrieving means 213, 
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which sends a request for data to the URL in the item. Data returned from the URL is 
then displayed to the user U1. 



In some instances, the instantiated context of the user U1 may indicate that 
5 the user U1 is not disposed to receive information. For example, the user U1 could be 
travelling, in a meeting, tired, in a badly lit environment etc., in which case 
responding to the information is difficult. In these situations, the second retrieving 
means 213 can withhold sending information until the user's context is determined to 
be more suited to receiving information. The second retrieving means 213 is 
1 0 therefore arranged to receive inputs from the context-determining means 209 that are 
representative of the user's instantiated context, and compare these with 
predetermined information providing rules, such as 

IF (Context == tired) OR (Context == Away) OR (Context == Low light) OR (Context includes 
travelling) etc. 

1 5 THEN (Set a timer (X mins) to check for context after X mins) where X is predefined 
ELSE (Send information to user) 
Clearly other forms of rules are possible. 

Adaption of profile: 

20 The embodiment presented above describes ranking items on a user's 

shortlist in accordance with the user's instantiated context, so that items relating to 
instantiated interests are presented to the user U1 first. In addition to the context 
information, preference attributes such as relevance, importance, etc. (described 
above) are also factored into the ranking process. However, it may be that the user 

25 U1 habitually ignores information relating to a particular interest {this is particularly 
possible for interests that have been derived implicitly based on rules (see example 
below)). 

As shown in Figure 7, the information provider 200 can further include 
monitoring means 701 for observing the way in which the user U1 accesses items on 
30 the shortlist, and adapting attributes of the user's interests accordingly. Preferably 
the monitoring means 701 includes a Bayes net 703 and co-operates with the second 
retrieving means 213. Items on the shortlist are input to the Bayes net 703, shown in 
Figure 8, to compute a priority associated with interests corresponding to those 
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items. As known in the art, a Bayes net considers the causal relation of history 
data in order to modify the a-priori probability of the occurrence of an event. A 
discussion of Bayes nets is given in "Artificial Intelligence - A Modern 
Approach" by S. Russell and P. Norvig, Prentice Hall, USA. 
5 The Bayes net 703 is operable to adapt the preference attributes, based 

on how the user U1 is observed to deal with items on the shortlist. For 
example, items that are placed high up the shortlist suggest that the 
information source is relevant to the user. In practice, however, the user may 
choose to access information relating to that interest later in the day. If that is 
10 the case, the monitoring means 701 uses the Bayes net 703 to modify the 
preference attributes, such that the next time an item relating to the interest 
appears on the shortlist, the monitoring means 701 can anticipate how the user 
U1 will want to deal with it, and factor this into the ranking of the interest 
accordingly. 

15 In addition to implicitly adapting the user's profile as described above, 

the second retrieving means 213 can be further arranged to receive information 
from the profile modifier 301. As described above, profile modifier 301 can 
push questions out to the user U1, requesting explicit rating of information 
sources - e.g. the user U1 could be presented with ratings from -2 to +2, and 

20 asked to indicate how relevant a particular information source was to them. In 
general, if a user returns a high score, second retrieving means 213 lowers the 
threshold of the relevance preference attribute in respect of interests that 
relate to that information source; conversely, if the user returns a low score, 
second retrieving means 213 increases the relevance threshold. 

25 If a user continually assigns a low score to information sources that 

have been identified for an interest, this could indicate that the keywords 
associated with the interest need modifying. The system administrator would 
either perform modification of keywords explicitly, or the user could be 
prompted to suggest some keywords. The user's response to the modified 

30 keywords could be monitored by the feedback, providing a means of improving 
the quality of the keywords in respect of an interest. 
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Data storage 

The information provider 200 is capable of providing information for many 
users, and a shortlist page view is generated for each user, on a per-user basis, every 
5 time the user requests information (automatically or manually). This places significant 
,oads on various components of the information provider 200, which can be reduced 
by caching data. The objective is to create pages, which are to be viewed by users, 
on the cache once only (i.e. request data from its stored location once only), so that 
any subsequent request for the page can be delivered from the cache. This shortens 
10 the request-response chain, improves response time, and reduces network load. 
Several caching arrangements are used: 
/ The items on the shortlist for each user is cached. 

The items on the shortlist for each user U1 are preferably post-processed by 
second retrieving means 213 to remove duplicates, and, as described above, are 
15 ranked in accordance with the user's instantiated context prior to storing .n the 
database DB1 . Typically, there will be a considerable number of items on the 
shortlist, but only a selection can be displayed to the user at any one time due to 
terminal display constraints. Thus the items on the shortlist need to be cached ,n a 
content cache (not shown), which can be provided by storage on the database DB1, 
20 so that the post-processing described above does not have to be performed for each 
selection of items that is displayed to the user. 

2 The page views comprising items on the shortlist are cached. 

As described above, the page views comprising the shortlist data are 

25 typically created using WML/HTML (although other formats are possible, see later). 
These page views are retrieved using HTTP, which is a request-response type of 
protocol - in its simplest configuration a client application, here the retrieving means 
213, sends a request to a server, here the index generator 205, and then the index 
generator 205 responds to the request by retrieving data from the content cache and 

30 sending it to the second retrieving means 213. In this configuration (not shown), 
there is a cache as an intermediary, such that the retrieving means 213 makes ,ts 
request to the cache (e.g. a request for a page view showing the shortlist) instead of 
to the index generator 205. The cache then makes the request to the index generator 
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205 on behalf of the retrieving means 213. The index generator 205 replies to the 
cache, and then the cache relays this to the retrieving means 213 (which passes the 
information on to the interfacing means 203). Each page view should obey the HTTP 
1.1 caching rules (see "Illustrated guide to HTTP", Paul Hethmon, Manning 
5 Publications Co. or RFC 261 6 Hypertext Transfer Protocol HTTP/1 .1 . R. Fielding, J. 
Gettys, J. Mogul, H. Frystyk, L. Masinter, P. Leach, T. Berners-Lee. June 1999. 
(Format: TXT = 4223 17, PS = 5529857, PDF = 550558 bytes) (Obsoletes RFC2068) 
(Updated by RFC2817) (Status: DRAFT STANDARD)); in particular the following 
must be included in the response header of the page views: 
1 0 (i) Expiry header, with HTTP date; 

(ii) Cache-control: max-age = sec tag, stating lifetime of the content; 

(iii) Content-length response header (which also allows persistent connections) 

In one arrangement, the network address (URL, item 3 in the index table) is pre- 
appended with the user's session ID, and changes to the session ID, such as changes 

15 to the user profile, updates to information sources etc. are used to control operation 
of the cache (when a user's profile is modified (e.g. an interest is added) then the 
cached page views are out of date (because the content populating the page view is 
out of date); to prevent these old pages being sent to the user U1, the session ID 
changes). Thus once the session ID changes, the cache is triggered to request new 

20 data from the index generator 205 (which retrieves data from the content cache if 
the type 1 cache is being used). 

3. The results of the queries for all users are cached. 

It is possible that, where there are a large number of users, some of the 
25 interests will overlap - i.e. User U1 will have some interests that are identical to the 
interests of user U2 (even though the context attribute tagged to the interest may 
vary between users), and it would therefore be inefficient to perform the same query 
a number of times. 

Consider performing a query - in respect of interests 11. ..In - for user U1 (assume 
30 that this query has not been performed before). 

• In accordance with Figure 6, second retrieving means 213 retrieves (step S6.3) 
URL and information source names corresponding to information sources that are 
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relevant to the user's instantiated context. This is stored in a query cache (not 
shown). 

• Keywords relating to the interests 11 ...In are hashed, saved e.g. in a file or 
similar, and linked to the respective identifiers in the query cache. 

5 • When a new query arrives in respect of interests J1 ... Jn (e.g. for user U2), the 
corresponding keywords are compared with the hashed keywords to identify 
whether a query has already been processed in respect of those interests. 

• If so: 

o second retrieving means 213 does not have to perform a query in respect of 
10 interests J1 ... Jn, and 

o interests J1 ... Jn are linked to the identified items in the query cache. 

Queries could be monitored for their time of request and degree of 

commonality across the user community, so that queries in respect of requests that 

are determined to occur frequently, and within a particular time interval, could be 
1 5 automatically performed at a predetermined time. 

The following is a worked example that illustrates the functionality of the 
information provider 200: 

Assume that the user has a user profile including the following sets of interests: 
20 Mobile Internet (Work) 
WAP (Work, Play) 
Durlacher (Work) 

Motorised transport (Work, Home) 
Television listings (Home, Play) 
25 Birdwatching (Play) 

Eating out (expensive food) (Play) 
Football - Startreck United (Play) 
Tattingstone White Horse (Play, Home) 

30 At 08:00, while en route for work, the user U1 receives an SMS alert 

informing him of local traffic delays around Martlesham. The user ignores the report, 
continues to work and gets stuck in a traffic jam. Bored, the user U1 decides to look 
at the news available from the information provider 200 on his WAP phone. The user 
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scrolls through the shortlist delivered by second retrieving means 213 and marks 
those interests that he wants to read later. By default the user U1 was shown work- 
related interests, because context-determining means 209 had determined the user's 
context to be work. However, there are links to Leisure-related interests. 
5 The user U1 has a meeting scheduled with Durlacher for 09:00. The user U1 

has set a high preference value for information relating to the Durlacher interest, so 
the shortlist includes information relating to Durlacher at the top of the list. 
Fortunately a description of a news article relating to Durlacher appears on the 
shortlist, and the user U1 is able to view the article prior to his 09:00 meeting. 

10 Later in the day, while at his desk and a little bored, the user U1 starts 

thinking about his evening activities. The user U1 changes the context to the 
Play/Home context, and is presented with a shortlist of items including TV 
programmes for that evening, news about his football team, and notice of the next 
Bird watching convention. 

15 At the weekend, the user U1 visits Norwich. From the user's banking details, 

the profiler 201 has included an interest in locations of NatWest cash machines, and 
from the user's motorised transport interest, the profiler 201 has included an interest 
in locations of petrol stations. Both of these implied interests are appended with all 
contexts (so in this embodiment Work, Home, Play, Away). The context determining 

20 means 209 determines that the context of the user is Away and Play, and, based on 
the user's determined location by tracking device 405, the second retrieving means 
213 retrieves information detailing the location of expensive restaurants, NatWest 
cash points and petrol stations around Norwich. 

25 Additional Details and Modifications 
Context changes: 

In the above description, the context of the user U1 is determined by 
context-determining means 209, and this information is used to arrange the data on 
the shortlist. However, it may be that the user U1 wishes to view information relating 
30 to a context other than his present one. For example, if the user U1 is at work, he 
may wish to see news relating to his football team, Startreck United. The interest 
football is likely to be accompanied by the contexts Play, Home or Play, Away. 
Assuming the information provider 200 is arranged to retrieve information relating to 
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the user's instantiated context only, the user U1 has to explicitly change the context 
in order to view this information.' The form displayed on the browsers (on any of the 
terminals T1, T2, T4) thus includes means for entering a request for change of 
context - either by entering a time, from which a context will be derived, or by 
5 explicitly entering the new context. When this form is posted by interfacing means 
203, as described above, the new context information is sent to the first retrieving 
means 21 1, and inputs from the context-determining means 209 are disabled. 

Different types of contexts can also be included - e.g. friends/family, and all 
of these contexts can be categorised by weekday and weekend (as weekend leisure 
10 activities are likely to be different from week day leisure activities). 

Categorising types of users: 

When a user U1 first subscribes to the system, the form presented to the 
user U1 on his browser can include a plurality of drop down menus and lists 
1 5 comprising interests selectable by the user. As an alternative, or in addition, the form 
can include descriptions of types of users, and the user U1 can indicate which type 
of user he is. For example, there may be several boots buttons: 

• Slippers button 

• Stiletto button 

20 • Trendy trainers button 

• Walking boot button 

• Cowboy boot button etc.. 

Each of these buttons has a pre-configured list of interests appended thereto, and, 
upon selection of one of the buttons, all of the respective interests will be assigned 

25 to the user U1. 

As an additional or alternative use for the buttons, the user U1 is prompted 
to select which of these buttons best represents them. The profiler 201 can be 
arranged to maintain lists of users against type of button and perform some sort of 
collaborative filtering and implicit filtering of interest data between similar types of 

30 users. 



Format of data: 
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The information may be summarised and delivered, and optimised to, a 
variety of formats/devices. The formats included, but are not limited to, HTML, WML, 
l-mode (cHTML), VxML, XML, Postscript, PDF. The devices include but are not 
limited to PCs, mobile phones, PDAs, other communications devices, games 
5 consoles, set-top boxes, multi-media kiosks, wearable computers, FAX and printed 
hardcopy. 

The information may be delivered on demand or alerted to the user U1. The 
alerting may use a number of mechanisms including but not limited to email, Internet 
Push services, SMS, WAP Push services, Voice. 
1 0 The indexing of data by Index generator 205 includes both local and remote 

sources held in formats including, but not limited to, XML, HTML, WML, CHTML, 
plain text, RTF and speech. 

The networks over which data is transmitted further includes UMTS, GPRS 
packet switched mobile networks. 

15 

Example of implicit adaption of profile: traffic information 

Assume that the user U1 has explicitly specified an interest in motorsport, transport 
and cars etc. for contexts Work and Home, and the profiler 201 includes a rule that 
states "If a user has an interest in motor vehicles, include an interest in traffic 

20 reports". In accordance with the description above, the user's profile will be modified 
to include the interest traffic. Assume further that the information provider 200 is 
arranged to automatically send the user U1 data at 08:30 every day. The context 
determining means 209 determines that the user is on his way to work (inputs from 
the tracking device 405 indicate that the user is moving, and that the movement is 

25 taking the user UT closer to work and further from home; inputs from the diary and 
clock 401, 403 confirm that the user is on his way to work). The first retrieving 
means 211 retrieves information from the user's profile relating to his instantiated 
context, and passes this information to the second retrieving means 213, which 
compiles a shortlist of information sources relevant to the user's interests, and sends 

30 the shortlist to retrieval engine 701 and to the user's terminal T1. As the traffic 
interest is relevant to the user's current context, the ranking associated with the 
traffic interest is likely to be high, and thus the interest appears relatively high up the 
shortlist. However, assume that the user chooses to ignore the traffic interest: the 
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Bayes net 703 modifies the preference weight associated with traffic, such that the 
next day (assuming the user is determined to be in same context) this modified 
preference weight will be applied and the interest is likely to appear further down the 
list (the actual position depending on the degree to which the weight is modified). 
5 The degree to which the preference weight is modified can depend on the 

number of days an interest is ignored, so that some sort of time dependent function 
is applied to calculate the weight. 

Aspects of the information provider 200, in particular the profile information 
stored on database DB1, may be operated behind a firewall, so that external parties 
cannot gain access to profile information. Alternatively the whole system could be 
run from within an intranet, or a secure network environment, such that the only 
communication with external systems is to receive updated information sources, for 
indexing purposes, and to communicate with a mobile user. 

As will be understood by those skilled in the art, the invention described 
above may be embodied in one or more computer programs. These programmes 
can be contained on various transmission and/or storage mediums such as a 
floppy disc, CD-ROM, or magnetic tape so that the programmes can be loaded 
onto one or more general purpose computers or could be downloaded over a 
computer network using a suitable transmission medium. 

Unless the context clearly requires otherwise, throughout the description 
10 and the claims, the words "comprise", "comprising" and the like are to be 
construed in an inclusive as opposed to an exclusive or exhaustive sense; that is 
to say, in the sense of "including, but not limited to". The words "X including Y" 
and the like are to be construed as indicating that X has at least Y, but that X 
most likely has other elements in addition to Y. 
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CLAIMS 

1 . A method of changing a profile representing subject matter of interest to a user, 
the profile including a plurality of sets of subject-related data, the method 
5 including the steps of 

(i) suggesting a change to contents of the profile to the user, which suggestion 
includes one or more selection choices representative of the suggested change, 

(ii) receiving one or more selection choices from the user, 

(iii) modifying the user profile in accordance with the or each selection choice, 
10 (iv) monitoring user action in respect of the modification, and, if the user action 

satisfies predetermined criteria, 
(v) permanently changing the profile in accordance with the or each selection 
choice. 

15 2. A method according to claim 1, in which the monitoring step (iv) includes the 
steps of 

presenting the user with at least one further selection choice, 
receiving the or each further selection choice, and 

comparing the or each further selection choice with the predetermined 

20 criteria. 

3. A method according to claim 2, in which the or each further selection choice 
comprises affirmation of the one or more selection choices representative of the 
suggested change suggested in step (i). 

25 

4. A method according to any one of the preceding claims, in which the monitoring 
step (iv) includes the steps of counting a number of times the user accesses 
information relating to the selection choice, and comparing the said number with 
a threshold number. 

30 

5. A method according to any one of the preceding claims, in which the monitoring 
step (iv) includes the steps of observing patterns of user access relating to the 
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selection choice, and comparing the observed pattern with one or more 
predetermined patterns. 

6. A method according to any one of the preceding claims, in which the monitoring 
5 step (iv) occurs with a frequency that is configurable by the user. 

7. A method according to any one of the preceding claims, further comprising 
performing at least one of comparative analysis and/or cluster analysis between 
profiles of two or more users, and receiving output therefrom, which output 

10 provides at least one of the selection choices representative of the suggested 
change. 

8. A method according to claim 7, further comprising evaluating confidence values 
associated with the output from the comparative analysis and/or cluster analysis, 

15 and in which the step (i) of suggesting a change to the contents of the profile 
based on the said output is performed in dependence on the evaluated confidence 
values associated therewith. 

9. A method according to claim 7 or claim 8, further comprising comparing the 
20 output with a list of changes that are deemed not to be relevant to the user, and, 

if any of the selection choices in the output matches any on the list, discarding 
the matching selection choices from the output. 

10. A computer program, or a suite of computer programs, comprising a set of 
25 instructions to cause a computer, or a suite of computers, to perform the method 

steps according to any one of claims 1 to 9. 
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